Measuring the accuracy of species distribution models: a review
نویسنده
چکیده
Species distribution models (SDMs) are empirical models relating species occurrence to environmental variables based on statistical or other response surfaces. Species distribution modeling can be used as a tool to solve many theoretical and applied ecological and environmental problems, which include testing biogeographical, ecological and evolutionary hypotheses, assessing species invasion and climate change impact, and supporting conservation planning and reserve selection. The utility of SDM in real world applications requires the knowledge of the model’s accuracy. The accuracy of a model includes two aspects: discrimination capacity and reliability. The former is the power of the model to differentiate presences from absences; and the latter refers to the capability of the predicted probabilities to reflect the observed proportion of sites occupied by the subject species. Similar methodology has been used for model accuracy assessment in different fields, including medical diagnostic test, weather forecasting and machine learning, etc. Some accuracy measures are used in all fields, e.g. the overall accuracy and the area under the receiver operating characteristic curve; while the use of other measures is largely restricted to specific fields, e.g. F-measure is mainly used in machine learning field, or is referred to by different names in different fields, e.g. “true skill statistic” is used in atmospheric science and it is called “Youden’s J” in medical diagnostic field. In this paper we review those accuracy measures typically used in ecology. Generally, the measures can be divided into two groups: threshold-dependent and thresholdindependent. Measures in the first group are used for binary predictions, and those in the second group are used for continuous predictions. Continuous predictions may be transformed to binary ones if a specific threshold is employed. In such cases, the threshold-dependent accuracy measures can also be used. The threshold-dependent indices used in or introduced to SDM field include overall accuracy, sensitivity, specificity, positive predictive value, negative predictive value, odds ratio, true skill statistic, F-measure, Cohen’s kappa, and normalized mutual information (NMI). However, since NMI only measures the agreement between two patterns, it cannot differentiate the worse-than-random models from the better-thanrandom models, which reduces its utility as an accuracy measure. The threshold-independent indices used in or introduced to the SDM field include the area under the receiver operating characteristic curve (AUC), Gini index, and point biserial correlation coefficient. The proportion of explained deviance 2 D and its adjusted form have been also introduced into SDM field. But this adjusted metric has no theoretical foundation in the context of generalized linear modeling. Therefore, we provide another adjusted form, which was proposed by H. V. Houwelingen based on the asymptotic 2 χ distribution of the log-likelihood statistics. Its superiority over other related measures has been found through previous simulation studies. We also provide another analogous measure, the coefficient of determination 2 R , which has had a long history in weather forecast verification and was also recommended for use in medical diagnosis. Though these measures 2 D and 2 R are routinely used to evaluate generalized linear models (GLMs), we argue that nothing prevents them from being applied to other GLM-like models. In SDM accuracy assessment, discrimination capacity is often considered, but model reliability is frequently ignored. The primary reason for this is that no reliability measure has been introduced into the ecological literature. To meet this need we also suggest that root mean square error be used as a reliability measure. Its squared form, mean square error, has been used in meteorology for a long time, and is called Brier’s score. We also discuss the effect of prevalence dependence of accuracy measures and the precision of accuracy estimates.
منابع مشابه
Modelling potential impacts of climate change on the oak spatial distribution (Case study: Ilam and Lorestan provinces)
Examining the effects of climate change on the oak spatial distribution, as the main species of Zagros forests and its ecological and economic values is of significant importance. Here, we used species distribution models for simulating current climatic suitability of oak and its potential changes in 2050 and 2070. For this purpose, five regression-based and machine learning approaches, four cl...
متن کاملPrediction of potential habitat distribution of Artemisia sieberi Besser using data-driven methods in Poshtkouh rangelands of Yazd province
The present study aimed to model potential habitat distribution of A. sieberi, and its ecological requirements using generalized additive model (GAM) and classification and regression tree (CART) in in the Poshtkouh rangelands of Yazd province. For this purpose, pure habitats of the species was delineated and the species presence data was recorded by the systematic-randomize sampling method. Us...
متن کاملPredicting the Potential Habitat Distribution of Crataegus Pontica C. Koch, Using a Combined Modeling Approach in Lorestan Province
Habitat degradation is one the important reasons of plant species extinction. Modeling techniques are widely used for identifying the potential habitats of different plant species. Thus, the purpose of current study was to determine potential habitats of Zalzalak in Lorestan Province. Species presence data and 23 environmental variables were collected in Lorestan Province. Correlation analysis ...
متن کاملModeling the current and future suitable habitat distribution of Fritillaria imperialis under climate change scenarios and using three general circulation model in Iran
Climate change may pose challenges to the conservation of plant species such as the Fritillaria imperialis that have narrow geographical distribution. In this study, the modeling suitable habitats of F.imperialis in Iran was done in the current condition and under climate change (2050). For this purpose, 78 species presence data along with 12 environmental variables including bioclimatic, physi...
متن کاملFitting of Species Frequency Distribution Models in Woody Societies in Rangelands of Gouradarah, Gechigiran Ahar
One of the important goals of natural resource management is to conserve plant diversity in the ecosystem so that sites are more productive and more ecologically stable at the various situations. Different diversity indices have been presented to study the species status of natural ecosystems. One of these indicators is the use of empirical distribution models that each of these models ind...
متن کاملEvaluation Approaches of Value at Risk for Tehran Stock Exchange
The purpose of this study is estimation of daily Value at Risk (VaR) for total index of Tehran Stock Exchange using parametric, nonparametric and semi-parametric approaches. Conditional and unconditional coverage backtesting are used for evaluating the accuracy of calculated VaR and also to compare the performance of mentioned approaches. In most cases, based on backtesting statistics Results, ...
متن کامل